Towards phone segmentation for concatenative speech synthesis

نویسندگان

  • Jordi Adell
  • Antonio Bonafonte
چکیده

We present a new approach to solve the problem of phone segmentation when preparing databases for concatenative Text-to-Speech synthesis. First, we describe the problem and review the state of the art. Then we present some already existing techniques to perform this segmentation and present our approach based on a Regression Tree to perform Boundary Specific Correction of the HMM segmentation. We discus different evaluation procedures. Finally, we compare some systems and we show how our system improves the system based on HMMs setting 94% of the boundaries within a tolerance of 20ms compared to a manual segmentation, and how phonetic rather than acoustical features are better suited for this task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Choose the best to modify the least: a new generation concatenative synthesis system

The paper describes a corpus-based approach applied in the evolution of ELOQUENS, the CSELT text-to-speech synthesis system for Italian, towards multi-voice, multilanguage, high-naturalness concatenative synthesis. The acoustic modules have been redesigned, according to the idea of reducing the number of junctions and the need of prosodic modification. Appropriate phonetic coverage methods were...

متن کامل

Concatenative speech synthesis for European Portuguese

This paper describes our on-going work in the area of text-tospeech synthesis, specifically on concatenative techniques. Our preliminary work consisted in investigating the current trends in concatenative synthesis and the problems that could arise when we apply the existing state-of-the art solutions to the specific case of European Portuguese. Our ultimate goal is to develop a text-to-speech ...

متن کامل

Automatic error detection in alignments for speech synthesis

The phonetic segmentation of recorded speech is a crucial factor in the quality of concatenative systems for speech synthesis. We describe a a likelihood-based error detection process that can be used to flag possible errors in such a segmentation, with a view towards manual correction. It is shown that this process can be used to assist in the creation of high-accuracy segmentations. In partic...

متن کامل

Syllable Specific Unit Selection Cost Function Using a Tone Modeling Technique for Automatic Phonetic Segmentation of Hindi Speech Using HMM

This paper presents a technique of improving tone correctness in speech synthesis of a tonal language based on an average-voice model trained with a corpus from nonprofessional speakers speech. Unit selection-based concatenative synthesis is one of the widely used speech synthesis approaches. This approach overcomes the limitations of other synthesis techniques such as articulatory synthesis an...

متن کامل

An artificial intelligence approach to concatenative sound synthesis

iii Content Overview v-vii List of Figures viii-x List of Tables xi-xii List of Abbreviations xiii-xiv Acknowledgments xv-xvi Author’s Declaration xvii CHAPTER 1: INTRODUCTION 1 1.1 Motivation 1 1.2 Introduction 7 1.3 Objectives 14 1.4 Thesis Structure 18 CHAPTER 2: PRINCIPLES OF CONCATENATIVE SOUND SYNTHESIS 20 2.1 Sound Synthesis 20 2.1.1 Rule-based Model 23 2.1.2 Data-driven Model 27 2.2 Sub...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004